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Abstract 

We study the scaling of the convergence of several statistical properties of a recently introduced 
random unitary circuit ensemble towards their limits given by the circular unitary ensemble (CUE). 
Our study includes the full distribution of the absolute square of a matrix element, moments of that 
distribution up to order eight, as well as correlators containing up to 16 matrix elements in a given 
column of the unitary matrices. Our numerical scaling analysis shows that all of these quantities 
can be reproduced efficiently, with a number of random gates which scales at most as nq\og{nq/ e) 
with the number of qubits Uq for a given fixed precision e. This suggests that quantities which 
require an exponentially large number of gates are of more complex nature. 
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I. INTRODUCTION 



Random unitary matrices play an important role in many tasks of quantum information 
processing, including quantum data hiding Q], quantum state distinction Ql quantum 



encryption [3|], superdense coding of quantum states and noise estimation [5|. In these 
applications, a random ensemble of N x N matrices U drawn uniformly from the Haar 
measure of the unitary group, the so-called circular unitary ensemble (CUE), is required 
6|. In principle, any unitary matrix acting on vectors in the Hilbert space of dimension 
= 2^'' of Ug qubits can be approximated with arbitrary precision using a computatio nally 
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universal set of quantum gates that act on one or two qubits at the time [3, ISl, l9|, [lO|, • 
However, as simple parameter counting quickly confirms, the required number of quantum 
gates Ug grows typically exponentially with the number of qubits. Indeed, OlN"^ {In N)^) 
gates are required to approximate all matrix elements of U using a fixed universal gate 



set [1^. This makes the construction of sets of random unitary matrices which are evenly 
distributed according to the Haar measure of the unitary group highly inefficient. One 
explicit but inefficient procedure of constructing matrices drawn from CUE is based on the 
Hurwitz parametrization (see [3]). 

In a seminal paper, Emerson et al. introduced the concept of pseudo-random unitary 
operators, i. e. random unitary operators which are drawn from a distribution that mimics 
a uniform distribution with respect to the Haar measure of the unitary group [l^. The 
construction of these operators was motivated by ideas from quantum chaos, and used a 
random quantum circuit consisting of random U{2) rotations on each qubit followed by 
two qubit-gates that implement an Ising spin interaction between nearest neighbors. They 
showed that this circuit produced unitary matrices with a distribution of matrix elements 
which converges exponentially with the number of quantum gates to the well-known distri- 
bution of matrix elements of CUE [l5|]. Later, Emerson, Livine, and Lloyd showed that the 
joint distribution function of matrix elements of a product of unitary operators created by a 
random quantum circuit composed of a continuous or discrete universal gate set converges 
uniformly and exponentially with the number of quantum gates to the Haar measure on the 
unitary group, albeit with a rate which itself decreases exponentially with the number of 
qubits 51]. This left open the question of the efficiency of the creation of the pseudo-random 
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unitary operators in the sense of the scahng of the number of gate operations with the num- 
ber of qubits. Furthermore, the distribution Pij{Uij) of matrix elements Uij contains only 
a small amount of information compared to the full joint distribution of matrix elements. 
Notably it is unclear how fast correlators of matrix elements would converge to the CUE 
values. 

Quite different statistics have been studied so far for different random circuit ensembles. 



In 



Iq the question of the efficient generation of typical bipartite entanglement between two 



subsystems was addressed numerically for a quantum circuit composed of U (4) gates, each 
of which was a product of a fixed two-qubit gate and two random single qubit gates drawn 
uniformly from the Haar measure of U{2). Exponential convergence to the CUE value with 
a rate that depends as Ug Inn^ on the number of qubits was found. Oliveira et al. introduced 
the technique of Markov chain analysis to study the same question and were able to prove 
an upper bound of 0{ng) quantum gates necessary to reach a given (absolute) precision e 
for the average amount of bipartite entanglement [171, llSj]. Average gate fidelity was studied 
in . The distributions of differences between nearest neighbor eigenphases as well as the 
distribution of the amount of interference was studied in 20| for the same random unitary 
circuit ensemble as the one we will use here (see below). Exponential or even Gaussian 
convergence was observed, but the question of the efficiency remained open. 

The study of pseudo-random unitaries is closely related to the theory of unitary fc-designs. 
Dankert et al. defined a unitary /c-design as a discrete set of unitary matrices such that the 
average of any polynomial of degree equal or smaller than k in the complex matrix elements 
of U over the set equals the average of that polynomial over the unitary group 21|. Harrow 
et al. showed that a random circuit of length polynomial in Ug yields an e-approximate 
2-design. Depending on the gate set used, the number of gates Ug needed to achieve a 
given precision e scales as Oijiqiriq + logl/e)) or as 0{nq\n.{nq/ e)). They also conjectured 
that a random circuit on qubits composed of poly(na^) random two qubit gates chosen 
from a universal gate set is an e-approximate /c-design 22|]. Originally unitary designs were 
defined for a fixed set of unitary matrices, each of which comes with the same weight (see 



23( 1 for an insightful discussion of their mathematical structure). The definition in 
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naturally extends the concept of unitary designs to probability distributions over sets of 
unitary matrices, such that each random unitary matrix corresponds to a realization of the 
random quantum circuit, and averaging over the random circuits realizes the average over 
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the unitary design. 
The results from 



22| imply that random quantum circuits can efficiently (with 



f If/jjP, \UiA'^ and If/iit/fr/P- Note that 



0{ng\n{ng/e)) gates) reproduce the CUE averages o: 

CUE averages of unpaired matrix elements vanish j24|]. Since entanglement fidelity and gate 
fidelity can be expressed as averages over polynomials of order (2,2) in Uij and f/^^, this 
result con&ms the numerical finding in 

In this paper we study numerically the efficiency with which the random unitary circuit 
ensemble (UCE) introduced in [2^ (see below for its definition) reproduces various statistical 
properties of the matrix elements of CUE matrices. Our study includes the full distribution 
of the (absolute square of) matrix elements, moments of that distribution up to order |f/jj|^^, 
as well as correlators containing containing up to 16 matrix elements of a given column of 
the unitary matrix. Within the range of numerically accessible sizes of the quantum circuits 
(up to 28 qubits for the distribution of a single matrix element, down to 15 qubits for the 
correlators), our results show that, surprisingly, the number of gates required to reach a 
given precision e for all of these quantities grows no faster than rig In (rig /e), indicating that 
the statistical properties of CUE which require an exponential number of quantum gates in 
order to be well approximated by a random quantum circuit, must be of a more sophisticated 
nature. 



II. CONVERGENCE OF UCE TO CUE 

The unitary circuit ensemble (UCE) introduced in [2^ consists of quantum algorithms 
which use two kinds of quantum gates: U{2) gates which act on single qubits, and the 
CNOT gate which acts on two qubits at the time. Each algorithm is built from a random 
sequence of these gates, where the probability that a given gate is a 1-qubit gate is Pg and 
the probability that it is a 2-qubit gate is 1 — Pg. We set Pg = 0.5 throughout this paper. 
The choice of the qubit(s) on which a gate acts, is made uniformly and independently for 
different gates over all the qubits. Fig ([1]) shows an example of this kind of algorithm for 
3 qubits and 5 gates. The U{2) gates are chosen uniformly with respect to the invariant 
Haar measure of the U(2) group. They can be parametrized with four angles a, tp, x chosen 
randomly and uniformly from [0,27r[, and = arcsin(^^/^) with ^ picked randomly and 
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FIG. 1: A random UCE circuit. The two different angles 6 and 0' mean two different random U(2) 
gates. 
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where we have abbreviated = (a, ip, Xi f) [3- The phases a only modify the global phase 
of the algorithm and are irrelevant for the statistical properties that we are going to study. 
From the results of 5|] it is clear that in the limit of the number of gates — > oo and fixed 
Uq, UCE converges to CUE. 

The UCE gate set might be summarized as T = {{^^,f/2 ® 12},{^^,12 ® 
f/2}, f/cNOTi,2}, {^7 f^CNOT2,i}}, where the first number in each pair in the hst is the 
probability that the second member of the pair will be selected in any step of the algorithm, 
/i(f/2) nieans the Haar measure of U{2), and ?7cNOTi,j is a controUed-NOT gate with control 
qubit i and target qubit j. It is easily checked that F is a "2-copy gapped gate set" in the 
terminology of [22]. This means that the operator G = /j^j.^^ U ®U ®U* ®U*dfir{U), defined 
for a general gate set distributed continuously over f/(4) with measure ^r{U), has only two 
eigenvalues with absolute value equal to 1. The difference between this largest degenerate 
eigenvalue 1 and the next smaller eigenvalue (in terms of its absolute value) is called the 
spectral gap A. Our gate set F has spectral gap A ~ 0.232703, if the gates are represented 
as 4 X 4 matrices. The gap is expected to decay as l/n^ if the gates are represented as 
matrices of size 2"', but will be finite for any finite Uq 251 ]. 



A. Distribution of matrix elements 

The uniform distribution of CUE matrices of size with respect to the Haar mea- 
sure of the unitary group U{N) yields a specific joint probability distribution P{U) = 
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P{Uii,Ui2, ■■■■jUnn) of the matrix elements Uij which entirely defines this ensemble. Con- 
vergence of UCE to CUE means that the joint probability distribution P{U) associated with 
UCE converges to P{U). However, a direct numerical study of the joint distribution is im- 
practical since the number of necessary realizations grows exponentially with the number of 
independent arguments of P{U). More practical quantities can be obtained from the joint 
probability distribution by integrating out several variables. A natural quantity to consider 
is the distribution of matrix elements, which depends on one complex variable, and which 
is obtained by integrating out the other A^^ — 1 complex parameters, 

P{U,,) =[■■■[ n dUkiP{U). (2) 

The first quantity we study in this paper, closely related to P{Uij), is the distribution 
of quantities defined by hj = \n{N\Uij\'^). For CUE, one shows in random matrix theory 
(RMT) that all lij are distributed according to the normalized distribution 



independently of the choice of the index of l^j 15 1. 



For UCE, the distribution of matrix elements is not independent of the elements chosen 
as long as the number of gates Ug is small, but becomes uniform over the matrix in the limit 
Ug ^ oo. For numerical efficiency, we have made two simplifications: 

First, we produce and propagate only the first column of the matrix. This obviously reduces 
drastically the memory requirement, and moreover, the action of a CNOT gate on this vector 
requires only the manipulation of a subset of the matrix elements. With the binary notation 
of the row index z of a matrix element Un, i = 1 + XloL 

CNOT between qubits k 

(control) and / (target) in [1, Uq] requires only the exchange of the 2"«~^ elements in positions 
where (cTfc = 0,ai = 1) with the 2"9~^ elements in positions where {ak = l,ai = 1). For the 
f/(2) gates, each element in the new column is a linear combination of two old elements, 
with c, s, — s, or c as coefficients. 

Secondly, we define P{1) by averaging both over the realizations ((. . .)/j) and the elements 
in the 1"** column {{. . .)c) , 

m = ^ (CHl))c)n (4) 

r=l i=l 
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where ^(^j-i"*) is the histogram for the i^^ component in the first column of the r^^ matrix. In 
order to obtain good statistics, it is important to produce a large enough number of matrices 
rir- However, for a given number of matrices, when one adds one qubit, the calculation time 
roughly doubles, because the size of the Hilbert space is doubled. For this reason, as long 
as Uq < 20, we considered an ensemble of rir = a2^~'^'' matrices (with a and b integers). 
In other words, when adding a qubit, the increase of the size of the Hilbert space by a 
factor 2 is compensated by reducing the size of the ensemble by a factor 2, without a loss 
of statistics. We choose a = 10 and 6 = 20 leading to a total of about 10'' matrix elements. 
Due to the correlations between the matrix elements (see below), averaging over a column is 
not quite as effective as averaging over realizations, and therefore the noise of the numerical 
data increases with growing Uq. For more than 20 qubits, numerical run time limitations 
forced us to fix the number of realizations to rir = 10. 
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FIG. 2: Convergence of P{1) to P{1) (dashed line) for 4 qubits with Ug =5, 10, 20 and 50 for an 
ensemble of 10^ matrices 



Fig. [2] shows, for Ug = 4, the convergence of P{1) to P{1) with increasing Ug. To quantify 
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FIG. 3: The distance Dp{ng) between the distributions P{1) and P[l) as function of the number 
of gates Ug for = 2, 3, . . . , 28 qubits (from left to right). 

the scaling of the convergence with the number of qubits, we define the quantity 



which represents a distance between the square roots of the UCE and CUE distributions. 
This distance goes to zero as UCE converges toward CUE for Ug ^ oo. Using the square 
roots rather than the distributions themselves is motivated by the fact that Dp is bounded 
from above by the value two, which simplifies the scaling analysis. Fig. [3] shows the behavior 
ol Dp as function of Ug for = 2, 3, . . . , 28. As expected, this quantity decays rapidly when 
the number of gates Ug grows, but the decay slows down with increasing Ug. Since our 
numerical calculations use a finite number of realizations, P{1) fluctuates about P{1). The 
distance Dp can therefore never vanish exactly, and we observe that it saturates for large 
Ug at a finite level dmin which depends on Uq. The level of saturation can be reduced 
by increasing n^. When Dp saturates, our ensemble becomes indistinguishable from CUE 
within the precision of the numerics. We have fitted Dp for each value of Ug and we observed 
that when Ug is small {ug < 12), Dp is well fitted by 2e~""« whereas for larger values of 
Ug, Dp has a pronounced quadratic component in its exponent (Dp ^ 2 e~""''~'^"«). The 
change in the functional dependence of Dp on Ug makes it difficult to determine the scaling 
of the rate of convergence with Ug. We have therefore preferred to base our analysis on the 
number of gates n* needed to achieve a fixed small value e of Dp for a given number of 




8 



400 
300 
n* 200 
100 





400 



300- 



n* 200 



100- 



5 10 15 20 25 30 




400 



300- 



n* 200 



100- 




15000 



10000 



5000 




FIG. 4: (Color online) The number of gates n* needed to achieve Dp < e for ln(e) =0,-1,-2,-3,-4 
and -5 (*, -|-, A, o, □, o respectively) and Uq = 2 . . . 28. Straight lines are fits to the functions 
/i, /2 and /s (1st, 2nd and 3rd plot respectively). The last plot shows ^ov these fits (/i (o), /2 
(□)), and h (o). 
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FIG. 5: (Color online) The coefficients ai (green squares) and 02 (red circles) as a function of 
ln(l/6). 

qubits. Figure m shows the behavior of n* as function of Uq for six different values of e (ln(e) 
between -5 and 0). We have fitted n*{nq) with three different 2-parameters functions, 

h = aiUq + bi, (5) 
/2 = 02 Uq ln(ng/ e) + 62 , (6) 
h = a3nq{nq + ln{l/e)) + b3, (7) 
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The linear function /i is an obvious choice given the appearance of the numerical data. 
The functions /2 and /s are motivated by the results in [22] on 2-designs. These authors 
defined the convergence of unitary /c-designs by the action on a test density matrix p of 
dimension kT^'' . As measure of distance, they consider the completely bounded ( "diamond" ) 
norm of the difference between the state Qw{p) = '^iPiUf'^ piU})®^ propagated by the k- 
design and Qh{p) = Ju U®^ p{U'^)®^ resulting from the propagator averaged over the unitary 
group. The gate set F = {{pi, Ui}} of unitary matrices Ui together with their probabilities 
Pi need to form a 2-copy gapped gate set. They show that a random quantum circuit of 
length Hg drawn from a 2-copy gapped gate set is an e-approximate unitary 2-design if 

> C {nqijiq + Xogil / e))) with some positive constant C which may depend on the gate set. 
In the special case of a gate set drawn uniformly from t/(4), which has maximum spectral 
gap A = 1 (i.e. G is a projector), it was found that an e-approximate unitary 2-design is 
already reached for > Cnqlogijiq/ e)). 

As mentioned, o ur g ate set V is indeed two-copy gapped, with spectral gap A ~ 0.232703, 
and the results of 22] do therefore apply to the convergence of UCE to CUE. While one 
should be cautious in directly comparing these results, which constitute an upper bound, 
and are based on the propagation of a trial state and the use of the diamond norm with our 
results which use Dp as a measure of distance, it seems plausible that the convergence of 
the distribution of matrix elements of the propagator should be related to the convergence 
of a propagated test state (see also 2^, where an efficient quantum algorithm for twirling 
was introduced). Based on the above cited results of 22] the function /s would therefore be 
the most natural candidate for a fit of n*{nq). However, it turns out that the function /2, 
even though relevant a priori for spectral gap A = 1, fits our numerical data much better, 
i.e. in what concerns the distribution of matrix elements, UCE converges to CUE much more 
rapidly than expected from the lower bound on Ug mentioned. 

The quality of the fits is measured by x^, the sum of squares of deviations (see Fig. Hj). 
We see that the simple linear behavior /i fits better than the quadratic form despite a slight 
upwards curvature of the curves n*{nq). That curvature is well captured by the Uqlnriq 
behavior of /2, whereas the quadratic behavior of fits much worse. The function /2 gives 
in addition the correct e-dependence, i.e. a coefficient 02 — 0.2 which is basically independent 
of e (see Fig. [5]). 

A clear distinction between /i and /2 is not possible based on the numerical data, as 
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both fit very well in the limited range of rig available. Nevertheless, our numerical results 
clearly indicate that, concerning the distribution of matrix elements, CUE can be efficiently 
simulated by UCE, in the sense that the number of gates used to achieve a given level of 
accuracy e grows only like riq Inijiq/ e) with the number of qubits, and in any case more slowly 
than n^. This is rather surprising, as P{1) contains information about all moments, and one 
is therefore led to the conclusion that no moment of appreciable weight in the reconstruction 
of the distribution should need more than 0{nq \n.{nq/ e)) gates before coming within distance 
e relative to the CUE value. In order to confirm this hypothesis, we have studied several 
k-ih moments directly. 



B. Moments of the distribution of matrix elements 



The /c-th moment /i^ of the distribution of matrix elements is defined as Hk = {y ) 



N''{\Uij\'^''). Invariant integration |2J] leads for CUE to 



k\N''{N-iy. 

(N + k-iy. ' 

which tends to /c ! for ^ oo and k fixed. For UCE we average again over both random 
realizations and elements in the first column of U, analogously to (jlj), and define the /c-th 
moment as 

~^^ = ^j:j^j:{y'i) =iiy')c)n- (9) 

where y^[^ is equal to A^|f/(/-[^)p for the i*^^ component in the first column of the r*'* matrix. 
As measure of the deviation from the CUE result we use the relative deviations 

^ _ l/ifc -/ifcl 

We have calculated D^^^ for k=2, 4, and 8. For the latter two cases, we used Ur = 10^ for 
Ug = 2, . . . , 14 and = 5 ■ 10^ for 15 qubits. Fig. [6] shows the behavior of D^^i'^^g) 
Uq = 2, ... ,18. The curves for and i^^g look very similar, with the exception of a higher 
saturation level, and are not shown. 

In Fig. [7] we plot the number n* needed to achieve a fixed small value -D^^. < e for all 
three moments studied, k = 2,4,8, and for different values of ln(e), together with fits to 
the function /i introduced above. The similarity between the three curves is striking. The 
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Figure demonstrates that n*{nq) is very well described by a linear behavior for all fc, and 
that furthermore even the slopes of that linear behavior are very similar for all moments. 
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FIG. 6: Dfj_,^{ng) for a number of qubits Uq varying between 2 and 18 (from left to right). 




FIG. 7: (Color online) The number of gates n* needed to achieve Z?^^. < e as function of the 
number of qubits together with fits to the function /i. Plots 1, 2, and 3 correspond to k = 2,4,8, 
respectively. The different symbols in a plot (A, o, □, o) represent different values of e, with 
ln(e) =0.5, -1.5, -2.5 and -3.5 for k = 2, ln(e) =-1, -2, -3 and -4 for k = A, and ln(e) =1, 0, -1 and 
-2, respectively for k = 8. 

We have also fitted the data to /2, but our range of Ug is too small to decide which of the two 
functions /i and /2 describes the scaling with Ug better. In fact, the numerical data n*{ng) 
for and show a slight negative curvature, which makes /2 nominally fit worse than 
fi. However, we believe that the slight negative curvature is a numerical artifact explained 
below, and secondly, /2 also represents the dependence on e very well. This is shown in Figure 
ISl where we have collected the coefficients ai and 02 for all moments considered. Figure M 
shows that the e dependence is correctly captured by the function ng\n{ng/e): 02 becomes 
basically independent of e for small enough e. Moreover, the prefactors a2 of all moments 
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considered converge to practically the same value once e is small enough, underlining once 
more the very similar convergence behavior of the three moments with k = 2,4, 8. The 
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FIG. 8: (Color online) The coefficients Oi (blue full lines) and a2 (red dashed lines) for the conver- 
gence of Z)^2 (circles), D^^ (squares), and (diamonds) as a function of the available values of 
ln(l/e). 

apparent slight sub-linear behavior of n*(nq) for and -D^g finds its explanation in the 
fact that the saturation levels of our numerical data for and D^g are higher than for 
D^^ and Dp, such that the possible values we can choose for e are closer to saturation than 
in Dfj_2. This slightly overestimates n*{nq), but less so for large rig, where the approach to 
saturation is slower, such that the curve n*{ng) appears to curve downwards. 

One might wonder if the slight negative curvature may not result from averaging over 
the column of the matrix. Indeed, while in CUE all matrix elements are equivalent in the 
sense that {\Uij\'^'') is independent of and additionally averaging over a column would 
therefore give exactly the same result, this is not the case in a UCE circuit of given finite 
length Ug. For example, after one gate, the first element Uu, (where the index 1 signifies 
the state |0 . . . 0) in the computational basis) is never affected by a CNOT, whereas others 
are. One effect of the convergence of UCE to CUE is that these inhomogeneities decay. One 
might suspect that averaging over the first column effectively reduces the inhomogeneities 
and could therefore provide a mechanism of accelerated convergence compared to a moment 
that has not been averaged over a column of U. As the sample size (in the sense of the 
number of elements in a column used to average) increases exponentially with Ug, small 
differences in the (l^^ijP^) are rapidly averaged out, and this might suggest a more rapid 
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convergence than for a single matrix element. Moreover, the effect is expected to become 
more pronounced for higher moments, which amplify small initial differences. 

To test this hypothesis we calculated for restricted sample sizes {ug < 10 and = 10'*) 
the forth and eighth moments fi'^ and /i'g for a fixed matrix element (we chose Uu and 
U31), defined as in Q but without averaging over the first column. For larger values of n^, a 
calculation that does not use averaging over a column is unfortunately beyond our numerical 
capacities. The corresponding signals D'^^ and D'^^ for the element Uu start off at a larger 
value than and D^^, and decay more rapidly, till the latter are reached. However, this 
happens at rather small values of Ug, whereas for larger Ug, the two curves D'^^ and D^^^ for 
the same k are basically indistinguishable within the precision of the data. Thus, averaging 
over a column does not significantly change n*{nq). 

On the other hand, we verified that also in and in Dp a slight negative curvature of 
n*{ng) can be produced by pushing e close to the saturation level. Furthermore, the quality 
of the fits to /i and /2 deteriorates for decreasing e. From a physical perspective a sublinear 
behavior seems impossible, as it would mean that the global state of a large enough quantum 
circuit equilibrates before even every qubit is touched by a quantum gate. All these elements 
confirm the explanation of the slight negative curvature as numerical artifact as discussed 
above. 

The main messages from Figs. [7] and Figs. [8] is that 1.) all moments considered converge 
at basically the same rates; 2.) the number of gates needed to achieve a given precision 
increases in good approximation linearly with the number of qubits, and 3.) the additional 
e dependence is well accounted for by a nqln{nq/e) behavior of n*{nq). 

C. Correlations between matrix elements 

Even in CUE, different matrix elements are not independently distributed (in contrast 
to cue's Hermitian cousin CUE). One obvious reason for the correlations is the ortho- 
normalization of columns and lines of a unitary matrix. We define correlations between k 
different i/ij for a same column j as Ck = (IliLi Vij) — ^^{Y[i=i l^uP); where the average is 



over the considered ensemble. In the CUE case, one finds through invariant integration [2 




(iV-1)! 



(10) 
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which differs from /i^ by a factor ^. Thus, for small k, the correlations are important, and 
comparable to the moments of the same order. For UCE we again average over the column, 
but include each element in at most one product in order not to create additional artificial 
correlations between the products, 

= ;rw] S 5^ n yt\~k+j)i ■ (11) 

'^rlki r=l i=l j=l 

Here, [x] means the integer part of x. We measure the distance to CUE as a relative deviation 
of Cfc from the CUE value. 

D., = M^i^. (12) 

Ck 

Fig. [9] shows results for the evolution of n*{nq) in the cases k = 2, k = 4, and /c = 8, as 
well as fits to fi. The behavior is predominantly linear, and very similar for all moments 
considered. The numerical data can also be fitted very well to /2, and again it is difficult 
within the limited range of Ug values available to us to clearly distinguish between one or the 
other. The function /2 fits in general somewhat worse (plot not shown) than /i, but this is 
due to the same numerical artifact of slight negative curvatures of n*{nq). Nevertheless, from 
Fig. [To] which shows the fitted coefficients ai and 02 for all three correlators, it is clear that 
the e dependence is correctly described by nq\n{nq/e), and that the prefactor 02 is largely 
independent of the order of the correlator for sufficiently small e. Moreover, comparing 




FIG. 9: (Color online) The number of gates n* needed to achieve convergence of the correlators Cfc, 
Dcfe < e for A; = 2,4, 8 together with fits to the functions /i (plots 1,2,3, respectively). For k = 2, 
ln(e) =-1, -2, -3, and -4 (A, o, □, o); for k = A, ln(e) =0,-1,-2 and -3 (A, o, □, o); and for k = 8, 
ln(e) =1, and -1 (o, □, o, respectively). 

Figs. (TU] and [H], we see that the correlations Ck converge basically with the same rates as 
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FIG. 10: (Color online) The coefficients ai (blue full lines) and 02 (red dashed lines) for the 
convergence of Dc^ (circles), (squares), and Dc^ (diamonds) as a function of the available 
values of ln(l/e). 

the moments of the same order, — a result to be expected from the theory of /c-designs 
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22l |. In fact, an alternative definition of a unitary fc-design is that any polynomial in 



the complex matrix elements of degree (m, /) with m,l < k has the same average over the 



unitary design as over the full unitary group 



III. CONCLUSION 



2l| 



We have studied the convergence of the distribution of matrix elements, moments of 
that distribution up to {\U.ik\^^), as well as correlations between matrix elements with up to 
16 factors \Uik\ within the same column, for random quantum algorithms drawn from the 
Unitary Circuit Ensemble (UCE) to their counterparts in CUE. Simulating quantum circuits 
with up to 28 qubits (for the distribution of matrix elements), and up to 18 (15) qubits 
for the moments (correlations), we have shown that all these quantities can be efficiently 
reproduced with a precision e using quantum circuits from UCE containing a number of gates 
that scales at most as n* < Cng\n{ng/e) with the number of qubits Ug, where C is a positive 
constant. Such fast convergence comes somewhat to a surprise, as for general two-copy 
gapped gate sets with a gap 1 > A > a quadratic upper bound, n* < Cuqiriq + ln(l/e)). 



has been shown j22| (UCE has spectral gap A ^ 0.232703). While it is clear that in order 
to faithfully reproduce the full joint probability distribution of CUE using UCE circuits one 
needs a number of gates which scales exponentially with the number of qubits, our results 
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suggest that the inefficiently reproduced quantities must be of more complex nature than the 
low moments of the distribution of absolute values of matrix elements and their low-order 
correlation functions. 
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